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Control scenarios have been identified where the use of randomized design may substantially im- 
prove the performance of dynamical decoupling methods [L. F. Santos and L. Viola, Phys. Rev. 
Lett. 97, 150501 (2006)]. Here, by focusing on the suppression of internal unwanted interactions 
in closed quantum systems, we review and further elaborate on the advantages of randomization 
QQ ' at long evolution times. By way of illustration, special emphasis is devoted to isolated Heisenberg- 

f^ , coupled chains of spin-1/2 particles. In particular, for nearest-neighbor interactions, two types of 

("^ ' decoupling cycles are contrasted: inefficient averaging, whereby the number of control actions in- 

O^ ' creases exponentially with the system size, and efficient averaging associated to a fixed-size control 

» I , group. The latter allows for analytical and numerical studies of efficient decoupling schemes created 

^ I' by exploiting and merging together randomization and deterministic strategies, such as symmetriza- 

.^^ ' tion, concatenation, and cyclic permutations. Notably, sequences capable to remove interactions up 

to third order are explicitly constructed. The consequences of faulty controls are also analyzed. 
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I. INTRODUCTION 

Dynamical decoupling (DD) provides a versatile control-theoretic setting for manipulating the dynamics of closed 
as well as open quantum systems. DD schemes operate by subjecting the system of interest to suitable sequences 
of external control operations, with the purpose of removing or modifying unwanted contributions to the underly- 
ing Hamiltonian. DD methods have a long history in high-resolution nuclear magnetic resonance (NMR) JlUa, y], 
where coherent averaging ideas have been pioneered in the context of removing undesired phase evolution [4. 5* and 
dipolar interactions 6] in spin systems. More recently, DD has emerged as a promising strategy toward achieving 
scalable quantum information processing (QIP), thanks to its potential for protecting logical quantum states against 
always-on qubit-qubit interactions and for suppressing environmental decoherence. The latter possibility was explic- 
itly demonstrated in [7[ - where suppression of decoherence via a sequence of very fast (so-called bang-bang) control 
actions is described for a single qubit interacting with a bosonic environment - and it was soon incorporated within 
a general dynamical symmetrization framework [8|, |9| - whereby the DD operations are drawn from a discrete control 
group so to effectively project out components with unintended symmetry. Since then, DD has become the subject 
of intense theoretical and experimental investigations. On the theoretical side, some notable advances include: the 
construction of bounded-strength Eulerian llQl and concatenated DD protocols [111, [I4I , as well as efBcient combina- 
torial schemes for multipartite systems [ij, [ij, [H, [l^ ; the identification of optimized control sequences capable to 
ensure exact high-order cancellation of pure dephasing in a single qubit [l7J, ll8| ; proposed applications within specific 
(notably, solid-state) scalable quantum computing architectures [19[; quantitative investigations of DD schemes for 
compensating specific decoherence mechanisms, such as magnetic state decoherence in atomic systems [20|, 1// noise 
in superconducting devices [2l|, |23, [23, |2J, [2g] , and hyperfine- as well as phonon- induced decoherence in quantum 
dots 26, 27, 28, 29|, [30, |3l| ■ Within experimental QIP, DD techniques have been successfully applied to decoherence 
control in a single-photon polarization interferometer [32|; have found extensive applications in liquid-state NMR 
QIP [33,], including in conjunction with error-correcting codes [3J]; have inspired charge-based [3^ and flux-based [3^] 
echo experiments in superconducting qubits; and are being scrutinized for further applications in solid-state systems 



such as nuclear quadrupole qubits 37[ and fuUcrene qubits [38 

Even if, in the absence of any control constraint and under appropriate mathematical assumptions, DD techniques 
may guarantee the exact elimination of all the undesired coupling, a main limitation is the fact that, in general, 
such an exact averaging is practically impossible. Residual errors arising from imperfect averaging accumulate in 
time and eventually result in loss of fidelity. In order to slow down error accumulation, randomization may be 
incorporated into DD design, as proposed in [33,[40]- In a sense, this is reminiscent of compensation schemes, which 
are routinely used in NMR spectroscopy to reduce the effects of known errors introduced by non- ideal control [l|, [j] . 
Essentially, randomization aims at compensating for imperfect averaging by enforcing probabilistic error build-up 
at long times, the overall coherent DD action being retained provided the applied control history is appropriately 
recorded [33,[40|- Beside long-time averaging, analytical errors bounds in [3^, [40| identified two other scenarios where 
randomized protocols would be expected to perform better than their deterministic counterparts: first, whenever the 
basic decoupling cycle requires a large number of control operations; and, second, when the interactions to be removed 
are uncertain, for instance unpredictably fluctuating in time. This prompted a series of quantitative studies to validate 



the advantages of randomization in the context of DD [41|, |4j, |43|, |4J, |45|, |4g| , and ultimately added quantum control 
to the list of problems benefiting from stochasticity; a list which already includes diverse phenomena ranging from 
the possibility to maximize weak signals by stochastic resonance [47i|, the idea that chaos may stabilize quantum 
algorithms 48], and yet, more recently, the fact that randomization may be used to benchmark noisy quantum gates 
in QIP [49ij. 

It is the purpose of this work to both comprehensively review and further analyze the advantages of randomized 
coherent-control methods at long evolution times. We focus on the representative case of suppression of internal 
interactions in a time-independent spin-1/2 Heisenberg-coupled system. In order to pinpoint the origin of the advan- 
tages coming from randomization, two scenarios are considered: averaging over an inefficient DD group, whose size 
increases exponentially with the number of spins; and averaging over a small, fixed-size group. The first case allows 
for a numerical comparison between deterministic and randomized schemes as the group size increases. The second 
lends itself to a detailed analytical study of various high-level deterministic schemes, which employ symmetrization, 
concatenation, and cyclic permutations - eventually leading to the identification of best-performing deterministic DD 
scheme. The incorporation and analysis of different randomized strategies to further boost the performance of the 
resulting deterministic schemes is then carried out numerically. 

Ultimately, the key idea for efficient averaging at long times is frequent scrambling of the order of the applied 
DD operations, so that residual errors do not get a chance to rapidly accumulate in time. While this idea is at the 
heart of randomized methods, a natural question is why randomization would have to be invoked in the first place: 
What prevents one from finding an optimal deterministic sequence for a specific system and a particular final time? 
The problem lies in the fact that, due to the rapidly growing number of possible control trajectories associated with 
different sequences as the system size increases, combined with the strong dependence of protocol performance on 
the final time, such search is typically intractable in practice. We illustrate this point by developing a numerical 
algorithm to obtain the best DD sequence under some constraints imposed to the controls. Although very efficient, 
the resulting sequence is still outperformed by a considerably simpler randomized protocol. We therefore advocate 
that, for long evolution times, a much less demanding and yet very efficient decoupling approach consists in cleverly 
combining good deterministic strategies with randomization. 

The content of the paper is organized as follows. In Sec. II, the theoretical framework of DD is briefly recalled, 
as well as the performance metric and interaction frames relevant to the subsequent discussion. Sec. Ill describes 
the deterministic and randomized protocols to be compared and present analytical lower bounds for their expected 
performance under ideal control assumptions. Sec. IV discusses the models to be studied and highlights the main 
control requirements. Focus is given to systems with nearest-neighbor couplings and to the ability to selectively 
address individual spins. In Sec. V, we compare how the performance of deterministic and randomized schemes 
depend on the size of the DD group. Schemes involving a large degree of parallelism emerge as best performers, 
consistent with intuition. The core of the paper is contained in Sec. VI. There, we present analytical studies for 
deterministic protocols; introduce a new deterministic sequence; compare numerically deterministic and randomized 
schemes, as well as different venues for including randomization; discuss the results obtained with different systems; 
and propose an algorithm to search for efficient DD sequences. A comparison of deterministic vs. randomized DD 
protocols would not be complete without the inclusion of some dominant control errors, which is done in Sec. VII. 
Conclusions and discussions are provided in Sec. VIII. Technical considerations are left for the Appendix. 

II. DYNAMICAL DECOUPLING FRAMEWORK 
A. Control setting 

As mentioned, DD methods have long been applied in NMR spectroscopy [l|, 0, S 1^] j where the aim is to modify 
the nuclear spin Hamiltonian to suppress or scale selected internal interactions. More recently, DD has been revisited 
in the light of quantum control theory, by also explicitly addressing, in particular, the removal of interactions between 
the system of interest and the surrounding environment 0, [a]. In both cases, the basic idea consists in adding 
an appropriate time-dependent control field Hc{t) to the Hamiltonian Hq of the relevant target system. In the 
physical (Schrodinger) frame, the evolution operator under the total Hamiltonian H{t) = Hq + Hc{t) becomes U{t) = 
Texp[— i/p H{u)du], where h is set equal to 1 and T denotes time ordering. Most commonly, the analysis of DD 
methods is performed in a logical frame (also known as "toggling frame" in the NMR literature), which corresponds 
to a time-dependent interaction representation that follows the applied control. In this frame, the Hamiltonian is 
written as 

HQ{t)^UUt)HoUc{t), (1) 



where Uc{t) = Texp[— i Jp Hc{u)du] is the control propagator at time t, and the logical evolution operator becomes 



U{t)^uKt)Uit)^T 
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In this work, we shall focus on an isolated (closed) finite-dimensional system S, controlled through a sequence of 
equally spaced control pulses, Pfc, applied at times i/j, A: G iNl (to = 0). The pulses average out the effects of unwanted 
interactions by repeatedly rotating the system and undoing its internal (drift) evolution. In the limiting situation of 
arbitrarily strong and instantaneous pulses - the above-mentioned bang-bang setting [7] - the evolution during the 
pulses depends only on the control Hamiltonian, whereas during the intervals Ai = t^ — ifc-i, the system evolves 
freely according to i/o- The propagator at i„ = nAi, n S N, then reads 

U{tn) = P„C/(i„,t„_i)P„_iC/(i„_i,t„_2)...PiC/(ii,0)Po 

- (P„P„_l...PlPo)(i'n-l-..Pl^o)tC/(tn,i«-l)(P„-l...PlPo)...C/(i2,tl)(PlPo)n^C/(ti,0)Po • (3) 

USn) U{tn) 

The design of multi-pulse sequences is based on the desired form of the effective propagator at a final evolution 
time T > 0. To derive the time evolution operator, different methods have been employed, including Fer's expansion^ 
which gives an exponential infinite-product expansion oiU{T) |50l.l5lLl52Ll53l|. and average Hamiltonian theory (AHT), 
which makes use of the Magnus expansion to represent U{T) in terms of a single exponential [H, I4I. Since the latter 
will be the main tool considered here, it is briefly described next. 

We begin by writing the logical propagator at an arbitrary instant i„ in terms of a single exponential. From Eq. p|. 



iP^HoPoAt 



U{tn) = exp [-i{Pn-i . . . Po)^Ho{Pn-i ■ ■ ■ Po)At] . . . cxp [-i{PiPo)^Ho{PiPo)At] exp 

= exp [—iH„At] . . . exp [—iH2At] cxp [—iHiAt] , 
= exp[-ii/e//(i„)i„] , 

where in the second line we have used the notation Hn = {Pn-i ■ ■ ■ Pq)'' Ho{Pn-i ■ ■ ■ Pq) for the transformed Hamil- 
tonians during a given segment of evolution, and the Magnus expansion (or Baker-Campbell-Hausdorff expansion, 
since the Hamiltonian is piecewise constant in time) [1,2, 54] to obtain the last equality. In the explicit expression of 
the effective Hamiltonian Heff(tn) = '^'kLo-^^'^K'tn), each term H'^^^{tn) is proportional to (Ai)'^/i„ and involves k 
time-ordered commutators of transformed Hamiltonians. 

The convergence of the Magnus expansion depends str ong ly on the representation considered, and examples exist 
in the literature of its failure at long times, see e.g. [55|. Explicit evaluations of the convergence radius have 
been obtained for specific systems, in particular for a two- level system [55|, l56l |57|, |58|, |59|, while general sufficient 
conditions for the absolute convergence of the expansion have been recently established, see e.g. [60] and references 
therein. Interestingly, it has also been shown that by connecting the Magnus expansion with rooted trees, a recursive 
procedure to generate the expansion terms and a convergenceproof become available |61i.i62]. For the current purposes, 
the condition kTc < 1, where k ^ |l-ffo|l2 = max [eig(iJo)] [H, 0) 123, l55[ , shall be used as a guideline for the Magnus 
series convergence. 

So far, no special assumptions have been made in regard to the control field, which may, in principle, have either 
deterministic or non-deterministic features. While specific protocols within each setting will be described in Sec. HI, 
the essential difference between deterministic and randomized design is that in the latter case the future control path is 
not known, but rather, in the simplest case, effect a suitable random walk [39|. In the particular case of a deterministic 
time-dependent perturbation which is cyclic, that is, when the control Hamiltonian and the control propagator are 
periodic with cycle time Tc, Hc{t + nTc) = Hdt) and Uc{t + nTc) — Uc{t), it follows from Eqs. (P)-© that the logical 
Hamiltonian is also periodic, and Heff{Tn) — H for any !"„ — nTc- At these instants, the system in the logical frame 
appears to evolve under a time-independent average Hamiltonian H = X]fc°=o ^ i the resulting propagator being 



U{nTc) = UiTj' = e-'"''^" . 

Accordingly, describing the system at any multiple integer of Tc only requires the computation of the system's evolution 
after a single cycle. This constitutes the main result of AHT, and is also directly applicable to the physical frame: It 
follows from Uc{Q) = 1 that UdnTc) = 1, which leads to the stroboscopic overlap of physical and logical frames at 
T„, U{nT,) = UinT,). 

For the deterministic DD sequences of relevance to this work, the first term of the average Hamiltonian H , namely 
H^'^' , may be cast in terms of a group-theoretic average [8|. In this case, control pulses are successively drawn from 



a (projective) representation of a finite DD group Q — {gj}, j = 0, . . . ,\Q\ — 1, with \Q\ giving the order of the group. 
The propagator after a control cycle, t — Tc — \Q\At, is written as 

lei-i 



j=o 



where 



Uj+i = g]U{tj+i,tj)gj, Hj+i = g]Hagj, Pj+i = gj+ig], Pq = go , (5) 

The zeroth, first, and second order terms of the Magnus expansion are now respectively given by 

'^ k=l 

\g\ i-i 



^'^' = -^EE[^''^^]' 



2T, 

\3 



1=2 fe=l 

\Q\ m-l i-1 -, n l-l 



^'" ^ ~&r\ E E E {t^-' [^'' ^'^■1 + t^"- HilHk]} + 2 E E {[^'' [^'' ^^-l + [Hi,Hk],Hk]} \. 

^ ^ m=3 1=2 fe=l 1=2 fe=l ^ 

In designing a DD scheme, one seeks an appropriate DD group Q which may 'reshape' the target Hamiltonian as 
desired. The primary goal is to tackle the dominant term H^^' , whose modification may be sufficient in the ideal limit 
of Tc — > or when dealing with very short evolution times. However, in realistic settings, and especially when long 
evolution times are involved, as in the current work, the role of higher order terms becomes critical, and strategies 
to reduce their effects are imperative. Among the various options, we shall discuss symmetrization, concatenation, 
cyclic permutation, and randomization. 

B. Performance metric 

Our main control objective in this work will be to achieve a 'no-op' gate or, in NMR terminology, a time suspension 
- that is, to freeze the system by completely refocusing the Hamiltonian evolution and making IJ{T) as close as 
possible to the identity, 1, for a desired finite time T. One way to quantify how successfully such objective is achieved 
relies on quantifying the input-output fidelity in the logical frame, 

Fp{T) = Tr[p(r)p(0)] , (6) 

where p{Q) is an arbitrary initial state of the system and p{T) = lJ{T)p{Q)lj\T). Arbitrary state preservation 
corresponds to the maximum value Fp{T) = 1. For a pure initial state lip), the above fidelity rewrites as 

F^^^iT)^MUiTm\'- (7) 

A disadvantage associated with i^|0^(r) is its intrinsic state dependence, a characteristic not suitable for a metric 
intended to assess dynamical protocol performance. In this sense, it would be more appropriate to invoke the pure 
state that leads to the worst-case pure state fidelity [S^l- However, the drawback associated with this option is 
practical unfeasibility, since searching for the worst {tp) is not operational, except for very small systems. Obtaining 
a control metric which is at same time state-independent and efficiently computable is possible by shifting attention 
from worst-case to typical input-state performance, as captured by so-called entanglement fidelity, Fg 63] . 

Entanglement fidelity is defined with respect to an initial entangled state \ip^^^) of the system S and a reference 
system R as Fe{p^ ,£^) = Tr{\%l;^^){ilj^^\p^^ }, where p^^ is the final state subjected to the evolution l''^ (g) £^ . By 
using the operator-sum representation, 8^ = X^u ^f /^^^f ^ ^e niay be written in terms of quantities of the system 
only, Fe{p^,£^) — J2n |Tr{p'^^u}P- For a closed system, Af^ = U, and the entanglement fidelity associated with a 
(any, in fact) maximally entangled purification |^^'') - thereby a maximally mixed state for S, p = 1 /d - assumes 
the simple form Fe{T) = lTr[[/(T)]/(ip ^], where d is the dimension of the system state space. Protocol performance 
may then be evaluated solely in terms of the system propagator. It is worth noting that a linear relationship exists 



between Fp and the average fidelity F over all possible initial pure states, F — {dFe + l)/{d+l), as formally established 
inRefs. ^^ 

By its own nature, randomized DD methods involve various control realizations, each leading to a different value 
of fidelity. Thus, control performance in this case is estimated in terms of an appropriate statistical average over 
individual results. In the logical frame, denoting by E the ensemble expectation over all control realizations, the 
expected entanglement fidelity is given by 

E{F,{T)} = E{\TY[U{T)]/d\^}. (8) 

Complete refocusing then translates into achieving E{Fe(T)} ^1. In the numerical Monte Carlo studies performed 
in what follows, ensemble expectation is replaced by the more viable statistical average over a sufficiently large sample 
of control realizations, which we designate by {{Fe{T))). 

C. Logical vs physical frame 

The logical representation is a convenient theoretical tool used to facilitate the design of pulse sequences. However, 
experiments are performed in the physical frame, so this is where our specific control objective need to be achieved. 
When dealing with periodic sequences, these differences are disregarded, since measurements are usually performed 
at the end of a cycle, where the two frames coincide. However, if one decides to observe the system in between cycles 
or deals with acyclic pulse sequences (as it is inevitably the case in randomized DD), correcting pulses Pc{t) may be 
required to guarantee the final desired effect in the physical frame. It becomes then necessary to keep track of the 
applied pulses, because Pc{tn) sX an arbitrary i„ is determined by the control propagator Uc{tn) as 

Pc{tn) = f/c(t„)^ = {PnPn-1 ■ ■ ■ ^2^1/^0)^ (9) 

Consider, for example, the case of quantum information storage. Restricting ourselves to achieving U{T) ^ 1 is 
equivalent, from the physical frame perspective, to assuring that the system evolution is dictated only by the control, 
U{T) — » Uc(T). This is clearly reflected in Eq. ©, which, by using Eq. ^, may be rewritten as 

F,{T) = Tr[p{T)p{0)] = Tr[p(T)p,(r)], (10) 

where p{T) = U{T)p{0)W{T) and pc{T) = Uc{T)p{0)U^{T). Thus, to freeze the system in the physical frame 
also, conditional to a given control history, we need a correcting pulse that un-does Uc(T), so that upon correction 
Fp{T) = Tr[p(T)p(0)] -^ 1, as desired. 

Notice that in quantum information storage, frame correction and signal acquisition are performed only once, at 
the final time T. However, when data need to be constantly acquired, such as in standard line-narrowing NMR 
spectroscopy experiments, frequently applying frame-correcting pulses may be experimentally demanding, besides 
introducing additional errors. In such cases, it may be worth designing control schemes which need not be cyclic, but 
already incorporate appropriate "observation windows" - an example is given in Sec. III.C. 

III. DYNAMICAL DECOUPLING DESIGN 

In this section, we outline several deterministic and randomized DD schemes and discuss lower bounds for their 
attainable fidelity. Better performance depends on the protocol capabilities to increase averaging accuracy in the 
effective Hamiltonian and to slow down the accumulation of residual averaging errors. Symmetrization, concatenation, 
cyclic permutations, and randomization are the key design principles exploited to generate efficient schemes. 

A. Deterministic protocols 

We shall assume that the first group element for deterministic protocols is always go — 1, or equivalently, that the 
first pulse occurs only after an initial time delay At. 

(i) The simplest deterministic protocol is a cyclic scheme based on a fixed, pre- determined control path of a specific 
representation of Q, leading to ffi'st-order decoupling, _ff*^°^ = 0. Any such scheme is referred to as periodic DD (PDD). 
Following Eq. (J4]), the logical propagator for PDD at Tc is given by 

U^Tc) = (^g]g^_,U{\g\At, i\g\ - l)Ai)g|e|-i) . . . (glu^SAt, 2At)g2) (fflc/(2Ai, Ai)gi) (<?tc/(At, 0)50 



which we compactly write as 

f7pDD(rc)-[C/|e|...C/3C/2(7i]. 

From now on, Tc will always refer to the cycle time of the PDD sequence. Our goal, however, is to push beyond PDD, 
by designing deterministic protocols able to eliminate and/or reduce higher-order terms in the average Hamiltonian. 
Three main strategies are considered: 

(ii) In analogy with the well-known Carr-Purcell sequence of NMR [5| , we may time-symmetrize the PDD control 
path. This leads to what we call symmetric deterministic DD (SDD). The cycle becomes twice as long, T^°° = 2Tc, 
but all odd order terms in Hq are also canceled 1, 2]. In compact notation, the propagator becomes 

f>SDD(2T,)= [UiU2U3...U\g\] [U\g\...U3U2Ui]. 

[sym] 

(iii) In concatenated DD (CDD), the basic PDD sequence works as a "seed" which is being recursively embedded 
within itself, as formalized in [111, [l3]- At level of concatenation {£ + 1), the pulse sequence in the physical frame is 
determined by Q+i — CiPiCiP2 ■ ■ ■ CiP\g\, where Co denotes the interval of free evolution and Ci is the generating 
inner PDD sequence. Level {£ + 1) is then reached at time T = \Q\^Tc. In terms of group elements, since go = 1- we 
may write 

Note that at ^ = 2, the above concatenated sequence is also symmetric, but, interestingly, it may outperform SDD 
even before this level of concatenation is actually completed, as analytically justified for the system considered here 
in Sec. VI. A. 3. This reflects CDD efficiency in reducing the effects of higher order terms in the effective Hamiltonian. 

Notice that if data is acquired before the completion of a given concatenation level, correcting pulses may be 
required to compensate for frame mismatch. Besides, CDD design is not cyclic. A periodic (or supercycle) version may 
be obtained by truncating the scheme at a certain level £, and then periodically repeating it at every T — n\Qf~^Tc 
- this protocol is denoted PCDDf. 

(iv) Yet another alternative is inspired by the Malcolm Levitt's (MLEV) broadband decoupling sequence used in 
high-resolution NMR |67l. l68l. l69l|. which will be referred to as symmetric cyclic permutation based DD (SCPD). This 
pulse sequence combines symmetrization and cyclic permutations of the group elements in the following way. At what 
we call first level, m=l, SCPD and SDD coincide. The cyclic permutations initiate at level 2, being restricted to the 
PDD part of the sequence as 

UscpbMG\Tc) = [sym][f/i[/|g| . . . C/3C/2] . . . [syni][C/|g|_i . . . U3U2UiU\g\] [sym][U\g\ . . . U3U2U1] , 



^lei A2 Ai 

From the third level on, the sequence for m+1 is based on permutations of the entire sequence obtained at m, being 
concluded at T = 2\Q\"^Tc. Following this rule, at m=3 we have 

UsC?uA'^\G?T^) ^ [AlA^gl . . . A3A2] ... [A|g|_i...^3A2Aiyl|c;|] [A^g^ . . . A3A2A1] . 

B\g\ B2 Bi 

Similarly to PCDD^, PSCPD,„ corresponds to a SCPD sequence truncated at level m and periodically repeated at every 
T = 2n\g\'^~^Tc. 

A main disadvantage of periodically repeated sequences is that residual errors due to the higher-order terms in H 
accumulate coherently. However, this build-up slows down if the path to traverse Q is constantly being changed, as 
indeed happens in both CDD and SCPD. This strategy is pushed to its limits by the use of randomization, as described 
next. 

B. Randomized protocols 

(i) The most straightforward randomized DD protocol is obtained by picking elements uniformly at random over Q 
(notice that the relevant Haar measure is simply given by 1/|^| in our discrete setting), such that the control action 



8 

at each t„ = nAi {to = included) corresponds to P^""^ = gig], where i,j — 0, . . . ,\Q\ — 1. This leads to the so-called 
naive random decoupling (NRD) - an intrinsically acyclic method, which therefore prevents the direct use of AHT. The 
logical propagator at T = nAt for each of the \G\" possible realizations is 

Ui}KD{nAt) ^[Ur^.. .Ur^Ur^], whcrc ri,r2, . . . ,r„ e i? and R ^ {1,2, . . . ,\Q\}. 

Comparing the two basic deterministic and randomized schemes, PDD and NRD, the first is expected to perform better 
at short times, because it leads to H^'^'> — 0, whereas no guarantee exists of achieving Heff{n\Q\At) oc At with NRD. 
On the other hand, at long evolution times, NRD is expected to outperform PDD, since it accumulates residual averaging 
errors more slowly. To ensure good performance at both short and long times it is then natural to seek for ways to 
merge advantageous deterministic and stochastic features in a single DD scheme. With this goal in mind, we now 
describe several high-level alternatives for randomized protocols, which may be thought as involving different choices 
for an "inner" and an "outer" control code ^44,]. The inner code establishes the pulse sequence to be employed in 
certain intervals of the total final time and aims at increasing the minimum power of At in the effective Hamiltonian, 
thereby improving short-time performance. The outer code determines the random pulses applied at the borders of 
such intervals, with the objective of slowing down error accumulation. 

(ii) A natural option corresponds to combining a fixed PDD sequence used in the interval [n, {n + l)\\Q\At with 
random pulses P^"^' at r„ = n|tJ|Af. The bordering pulses may or may not be drawn from the same group Q . In the 
first case, embedded DD 1 (EMDi), the logical propagator at T = \Q\At for each of the \Q\ possible realizations is 

UmT>A\Q\^t) = g]P\g\ ■ ■ ■ U3U2Ui]gj. 

As an example of the second case, we mention the protocol implemented in [4l|, here called EMD2. The inner sequence 
corresponds to a PDD based on a certain group Q, while the bordering pulses are drawn uniformly at random from 
the irreducible Pauli group Q^ = ®iQi, where Qi = {!,, X^, Yi, Zi\, i — 1,2, . . . ,N, N is the total number of two-level 
systems i, and Xi, Yi, Zi are the Pauli matrices associated with each i. At T = |5| Ai, there are 4^ possible realizations 
and the propagator for each one is given by 

c/EMD,(|^|At) = .gt[c/|g| . . . c/3C/2C/i]<?p, ffp e e^ . 

Since the number of realizations in EMD2 is usually much larger than in EMDi, error accumulation in the former is 
slower. In practice, however, situations may be encountered where control capabilities restrict protocol design to a 
single group. 

(iii) The use of the PDD sequence as the inner code guarantees only that an effective Hamiltonian with norm of 
0{At) is obtained. To ensure higher powers in At, we may embed with random pulses higher-level deterministic 
protocols, such as SDD, PCDD^, and PSCPD,„, which lead to schemes respectively denoted here by ESDD, EPCDD^, and 
EPSCPDm. 

(iv) Another disadvantage of having a PDD sequence as the inner code is the fact that its performance may vary 
significantly depending on the specific path chosen to traverse Q. In cases where searching for the best option is 
costly, such as when Q is large, a better alternative consists in randomly choo sing at every T„ = n\Q\At a control 
path to traverse the group, leading to so-called random path DD (RPD) [39, 44, i4^. This scheme becomes yet more 
promising if the random paths are symmetrized in the same manner as in SDD, leading to symmetric random path DD 
(SRPD) [43|, |4J, I43] . The logical propagator at T = 2|C/|At for each of the \Q\\ realizations is then given by 

UswD{2\Q\At) = [sym][f7^|g| . . .Us^Us^Us^], si e R, S2 e R- {si}, ..., S|g;| e R- {si, 82,83,. .. ,S|g;|_i} 

Since randomized protocols are intrinsically acyclic, correcting pulses are usually necessary before acquiring data. 
To avoid them, schemes which, as mentioned, may already contain suitable observation windows may be designed. 
As an example, we mention a pseudo-RPD: in this case, path randomization is restricted by the condition of having 
Uq at every interval [n|5|Ai, {n\G\ + l)At], which ensures that physical and logical frame then coincide. 

C. Performance lower bounds 

Analytical bounds on the expected fidelity decay offer insight on relative strengths and weaknesses of the proposed 
DD schemes. Here, we both review existing error bounds and extend them to some of the new protocols of interest. 

In the limit of sufficiently short time, H-ffolbr < 1, following Refs. ^, |41| and expanding Eq. ([7]) to second order 
in T, the evolution in the logical frame of the fidelity for periodic DD may be written as 



An upper bound for the square of the residual interaction {AH)"^ = {ip\H'^\tp) — {ip\H\iJ;)'^ is given by the norm of -ff as 
{AH)'^ < \\H\\2- In addition, the norm of the average Hamiltonian may also be bounded by ||^||2 < S?!Lo '^i'^'^cY , 
which finally leads to 

F|v,)(T)>i-(f;«(«r,)^") T'. 

\ i=o ^ 

A major factor influencing the performance of a deterministic protocol is its ability to suppress dominant terms in H. 
Assuming that the convergence condition kTc < 1 is satisfied, and recalling the linear relation between F and Ff,, we 
infer the following properties: 

• PDD cancels iJ^"), therefore ||i?||2 < k'^Tc/{1 - kTc). The limit \\H\\2T < 1 implies k^T^T < 1 - kT^, which then 
leads to FeiT) > 1 - OiK^i\g\At)^T^). 

• SDD cancels H'^°^ and H'-^\ thus ||i7||2 < k^T^/{1 - nTc), thereby Fe(T) > 1 - C'(K*'(|5|At)'*T2). 

The derivation of lower bounds for the performance of CDD [SCPD] is not straightforward, depending on three 
elements: the level of concatenation [permutation], the model system, and the decoupling group considered. This is 
better discussed in Sec. VI, where the dominant terms of H are explicitly computed for some particular models. Here, 
we simply mention that when compared to PDD and SDD, PCDD {i > 1) and PSCPD (m > 1) are usually more efficient 
in reducing higher-order terms in the average Hamiltonian. 

Contrasted with periodic methods, where residual errors due to higher order terms in H build up coherently (hence 
quadratically in time), the fidelity for random protocols decays linearly in time. This may be justified as follows. 

• Each step of NRD can accumulate an error amplitude up to kAI, and during a time T there are T / At such 
intervals. Due to randomization, amplitudes add up probabilistically, which leads to E{Fe{T)} > 1 — O(K^AtT). 
The formal derivation of this bound in the limit of K^AtT <C 1 is presented in [391 ■ 

The reasoning is similar for the other protocols, although now each step corresponds to the interval g|C/|At, where 
q = 1 for EMDi, EMD2, and RPD, and g = 2 for SRPD. The bound becomes £{Fe{T)} > 1 - 0{\\Heff\\l\g\AtT), the 
norm of the effective Hamiltonian being an important difference between protocols. 

• EMDi, EMD2, and RPD lead to E{Fe{T)} > 1 - 0{K*{\g\AtfT). 

• SRPD gives £{Fe(T)} > I - 0{K^{\g\AtfT). The same lower bound holds for ESDD, EPCDD^>i, and EPSCPDm, 
although for the last two protocols averaging may be significantly better. 

In general, based on the above estimates, we then expect randomized methods to outperform their deterministic 
counterparts at long times. For T > {n'^lGl'^At)''^, we should eventually have E{F™(T)} > F™(r), while T > 
(K^l^jAt)-! leads to E{Fe™°/^''°(T)} > Ff™(r). However, in order to quantitatively compare SRPD with CDD, EPCDD, 
SCPD, and EPSCPD, we need to specify the model in more detail. Notice that NRD is the only protocol showing no 
dependence on the group size, which makes it a method of choice in cases where |^| is very large. 

IV. MODEL SYSTEM AND CONTROL REQUIREMENTS 

A. Model system 

We consider a chain with N strongly coupled spin-1/2 particles (qubits) described by the Heisenberg model, that 
is, the internal drift Hamiltonian in the physical frame reads 

N (z) N 

i?o = Hz + H.nt = E^+E E J^f-^^-f^ (11) 

i—1 i<j a—x,y,z 

where a'^"'' = a^^'"^'^' = X,Y,Z are the Pauli operators, uji is the Zeeman splitting (Larmor frequency) of spin i as 
determined by a static magnetic field in the z direction, and J^" is the coupling parameter between spins i,j in the 
a direction. Open boundary conditions are assumed. 
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To illustrate the benefits of randomization, we concentrate on the simple case of homogeneous nearest-neighbor 
(NN) coupUngs, for which very efficient DD schemes exist (see Sec. VI). By assuming J^- = Jf- = J and J?- — aJ, 
where a is the coupling anisotropy associated with the Ising contribution, we thus have: 

N N~l 

Hnn ^Y.^+Y.'^ [X^X,+^ + Y,Y,+i + aZ,Z,+{\ , (12) 

1=1 i=l 

This Hamiltonian is used to model quasi-one-dimensional magnetic compounds [7C/| and Josephson-junction-arrays |7ll . 
|72| | . It is also a fairly good approximation for couplings which decay exponentially with the qubit distance - as arising, 
for instance, in semiconductor quantum dot arrays [73|, or which decay cubically - as in dipolarly coupled solid or 
liquid- crystal NMR spin systems [l|, y, uM ^"^^ electrons floating on Helium [73, UM ■ 

Whenever qualitatively different, we shall compare the results associated with the above Hnn with those obtained 
from cubically decaying interactions as approximated by the following Hamiltonian 



N r:, N 



Hcub - Y -^ + H -^ 



i<j 



Ji-iJi-j 'T' J^i^j I O^^i^j 



u 



(13) 



Although neglected here, an additional dependence on the angle between the vector joining spin pairs and the external 
magnetic field is present in principle in the secular dipole-dipole coupling parameter of NMR spin systems [l|, y| . 

B. Control requirements 

In order to suppress the interactions in Hamiltonians p2p and (J13p , we shall assume the ability to apply sequences 
of selective pulses, that is, control pulses that affect only some intended (subset of) spins. This is to be contrasted with 
non-selective (or collective) pulse sequences, which affect all qubits uniformly. A well-known example of the latter is 
the so-called WAHUHA (or WHH-4) sequence developed by Waugh, Huber, and Haeberlen [73| to suppress direct 
dipole-dipole couplings. A quantitative analysis of randomized versions of this sequence, which may have implications 
for solid-state NMR QIP, is left for a separate investigation. 

Besides selectivity, another important feature of control pulses is the rotation angle they effect. Let us assume 
that, as it is the case in typical spin resonance experiments, the system couples to an oscillating control field linearly 
polarized in the x direction according to 



Hcit) = 2n{t) cos[ujft + ip{t)] Y 



^ X,. 



i=i 



where the amplitude (power) 2fl, the carrier frequency u>f and phase tp, as well as the interval r during which Hc(t) 
is on, and the separation At between successive pulses are under experimental control. The field is rapidly switched 
on and off so that Q{t) may be approximated by a piecewise constant. In the rotating frame of the carrier, which 
rotates with frequency ujf, the effective total Hamiltonian is given by 

H^it) = U''\t)(Ho+H,{t) -^Y^zAu'^it), C/«(i) =exp f -zc^t^^ 

^ i ^ ^ i 

The interaction part of the Hamiltonian is invariant under this transformation, but, upon invoking the rotating wave 
approximation 1], the linear terms and the control Hamiltonian become 



i—1 i—1 i—1 



-^cosip{t) + ^smip(t) 



From the above equations, we see that a given spin i is rotated when the control field is applied on resonance with its 
frequency, ujf w Wi (that is, the detuning A^ « 0). The phase ip{t) then determines the direction around which the 
rotation is realized in the rotating frame, and, in the case of rectangular pulses, f2r characterizes the rotation angle. 
For instance, a pulse with u!f — u!2, 'Pit) = 0, and fir = tt flips spin 2 by 180° around the x-axis. Here, the systems 
described by Eqs. (|12p and P^ will be subjected to sequences of 7r-pulses, while for instance the above-mentioned 
WAHUHA sequence involves 7r/2-pulses. 
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All the analyses developed in this paper are performed in the rotating frame. The spins in the systems of interest 
are assumed to be addressable in frequency or by some other means, thereby the possibility of using selective pulses. 
Additionally, the differences \ujj — LUi\ are assumed not to be much larger than the qubit-qubit coupling strength J, 
so that pulse sequences involving rotations around more than a single axis are required. If indeed \ujj — LOi\ 3> J, 
the secular approximation leads to a truncated Hamiltonian where only terms in the z direction remain [3|. In this 
situation, DD may be effected by only using rotations around a single axis perpendicular to z 0, Q . In the case 
of nuclear spin-1/2 Hamiltonians, this means that we are not interested in heteronuclear systems, since the Larmor 
frequencies of two different nuclear isotopes are separated by several MHz, while the couplings are of the order of 
tens or hundreds of Hz. Instead, our analysis has direct implications for homonuclear systems, where the spins are 
differentiated by their chemical shifts 5i , and \Sj—6i\ > J. Chemical shifts emerge from the presence of electrons, which 
generate different small magnetic fields at different sites and cause variations of the net magnetic field experienced by 
the nuclei; the spin frequencies loq of the isotopes are then shifted as tOi ^ luq + 6i. 

In short, we shall focus our analysis on the effects of DD sequences with multiple axes of rotation and selective 
pulses applied to the following rotating-frame Hamiltonians: 

HnN ~ -f^int , (14) 

Hz+NN ~ 2^ -^ + ^int , (15) 

i 

where in both cases Hint is given by the bilinear-NN- interaction terms described in Eq. (|12p , and the two cases differ 
by the explicit inclusion of linear (chemical-shift) contributions. A comparison between the results for H^j^ and those 
for the cubic-decay couplings of Eq. P^ . H^f^, will also be provided. 

V. RANDOMIZATION OVER INEFFICIENT DECOUPLING GROUPS 

We begin by assessing the advantages of randomization in the case of an inefficient DD group, that is, a group 
whose size increases exponentially with the number of qubits. In an appropriate logical-rotating frame, the system 
we consider is described by H^j^^ GM- Since Qk — {Ifc, Zj,, Xj., Yfc} leads to PDD sequences capable of refocusing 

"■fc-i ■ o'fc and aj^' ■ crj^_li, it is straightforward to see that Q = ®kQk, with fc = 2, 4, . . . , 2m and to G N, may be used 
to obtain PDD sequences which decouple up to N qubits, where N = 2m or N — 2m + 1 @, [lj|- When A^ = 4 or 5, 
for instance, a possible DD scheme may be visualized in terms of the following matrix, 

.IZXYYXZl 1 Z X Y Y X Z 1 
'll 1 IZZZZXXXXYYYY 

where each row corresponds to an even qubit and each column, supplemented with the identity operators associated 
to the odd qubits, leads to an element of the group, so that Q — {gj}, j — 0, . . . ,\G\ — ^, with gj = 1^ (g) [M(i j+i)]2 (8) 
I3 €5 [M(2j+i)]4 (8) I5. The proposed DD group requires 4™ 7r-pulses to close a single PDD cycle. Any path taken 
to traverse G leads to first-order decoupling, however notice that a sequence arranged as in M has the property of 
avoiding simultaneous rotations '14*. Contrary to that, a path as in 

, _flZXYlZXY 1 Z X Y 1 Z X Y 
'11 1 IZZZZXXXXYYYY 

for example, leads to simultaneous rotations at every t — An At, n e N. Among all PDD sequences derived from the 
above group G, a very small subset consists of sequences involving only single-qubit rotations; as \G\ increases, most 
paths have in fact a large number of control actions involving simultaneous rotations on several qubits at a time. 

In the NRD protocol, which is based on uniform randomization over G, possible control operations range from the 
total absence of rotations (the identity operator) to collective rotations on m qubits at once. In large systems, the 
fraction of pulses corresponding to extreme cases is very small. Let Qr = 3^m\/[r\{'m — r)!] denote the total number 
of random pulses leading to r simultaneous rotations for a given number m of even qubits, where r = 0, 1, . . . , ?n, and 
S^o Qr = 4™. On the one hand, the percentage of pulses associated with a single qubit rotation, r — I, and with 
the maximum number of rotations, r = m, decreases with the size of the system - as 37TT./4™ and (3/4)™, respectively. 
On the other hand, the degree of parallelism increases significantly with \G\- Given m, the largest Qr is obtained for 
r in the interval [(3m — l)/4, {3m + 3)/4] when to, 7^ 3 + 4n, whereas for m = 3 + An, both values, (3m — l)/4 and 
(3to, + 3)/4, lead to sets of equal size. This means that, for large m, the largest set of random pulses involves rotations 
on roughly 75% of the even qubits. 
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FIG. 1: PDD vs NRD based on a nested pulse sequence for i/^jv (|14ll with a = 1. Top panels: A'^ = 6, \G\ ~ 4^- Bottom panels: 
N — 8, \G\ = 4*- Left panels: Ensemble-averaged entanglement fidelity at T„ — n\Q\At; At = 0.8J^^ /\G\. The numbers 1, 2, 
3, and 4 stand for PDDi, PDD2, PDD3, and PDD4, respectively. Free evolution: (black) oscillating solid line. Right panels: {(iv )) 
at t„ = nAt < Tc; Tc = 1.28J~^. Average over 10^ realizations. 



Whenever a high degree of parallehsm is afforded, more efficient DD schemes exist where the total number of pulses 
needed to close a PDD cycle is significantly reduced (see Sec. VI). However, the interest in the inefficient averaging 
schemes analyzed here lies on the possibility to contrast the effects of single rotations versus simultaneous rotations, 
and to study DD under large control groups, while avoiding computationally intractable system sizes. In Fig.[Tl results 
on the decay of the ensemble-averaged entanglement fidelity in the rotating-logical frame, {{F^)), are shown. We 
consider A^ = 6 (A^ = 8) qubits in the top (bottom) panels, which leads to a relatively large control cycle: 64 pulses 
(256 pulses). In each column, both top and bottom panels have the same value of Tc- We compare NRD with different 
PDD sequences: PDDi - based on the path given by M; PDD2 ~ based on the M' path; and PDD3, which corresponds to 
a particular path selected at random and repeated at every T„ = nTc. The beginning of the PDD3 sequence used on 
the bottom panel is equal to the PDD3 from the top panel. Another randomly selected path without this constraint is 
also considered in the case of A'^ = 8 and is referred to as PDD4. 

When designing a PDD sequence, it is natural to start with straightforward structures such as those given by M or 
M'. However, they are not necessarily the best options. In the left panels, fidelity is computed at every T„ = n\Q\At. 
By contrasting top and bottom panels, we verify that the performance of NRD improves significantly as \G\ increases, 
while PDDi and PDD2 remain essentially unchanged. This explains the crossing between these two curves and the 
randomized protocol in the case of \G\ — 256. The strong enhancement resulting from parallelism becomes then 
evident and suggests that better deterministic sequences ought to exist. In this sense, the selection of an efficient PDD 
sequence is a posteriori motivated by the study of a stochastic scheme. In fact, PDD3 and PDD4 offer much better DD 
options. In situations where different paths lead to such a broad range of performance and path optimization cannot 
be afforded, it is more appropriate to use a protocol based on path randomization, such as RPD. This scheme offers 
advantages also at long times, as it will be shown in Sec. VI. 

In the right panels of Fig. [H we also compare PDD and NRD during intra-cycle times, i„ = nAt. This may be of 
interest in situations where constraints on the number of pulses or control intervals make it unfeasible to close a 
complete cycle, for instance, when Tc becomes prohibitively long. The decline in the PDD performance for i„ < Tc 
followed by its recovering as i„ — > Tc refiects the fact that deterministic sequences are designed to perform well at the 
cycle completion, little being expected from them during intra-cycle times. Notice that up to half of the cycle, NRD is 
a protocol as good as, or even better than, the selected deterministic sequences. 

We have then verified the beneficial contribution of parallelism in DD sequences, which is increasingly pronounced 
as the group size grows. However, to "disentangle" the two effects and isolate the impact of \Q\ in deterministic vs. 
randomized schemes, examining protocols which have the same degree of parallelism is needed in principle - e.g., 
those derived from combinatorics [14| . The difficulty of such analysis, however, lies on the large system size required. 
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which makes numerical sim.ulations practically unfeasible. 

As a further remark, we call attention to the cycle time used in the figure: T^. ^ J^^ is one order of magnitude 
larger than the values determined by the convergence criterion kTc < 1. In the case of iV = 8, for instance, k ~ 20 J. 
This confirms that the criterion is overly pessimistic in practice, and values of Tc not necessarily complying with it 
may still lead to a substantial reduction of unwanted interactions in specific situations of interest. 

VI. RANDOMIZATION OVER EFFICIENT DECOUPLING GROUPS 

We now focus on addressing the long-time behavior of the protocols described in Sec. III. By long times we mean 
times where the analytical lower bounds are no longer reliable, T > (K^At)^^. Given the nearest-neighbor interactions 
under consideration, a very efficient DD group is now able to be identified, for which PDD always involves only 4 selective 
multi-qubit pulses irrespective of system size. Possible representations of the relevant control group for even N are: 

GxY = {1, ^1^3 • • • ^AT-i, XiY2X3Yi . . . Xn-iYn, Y2Yi . . . Fjv}, 

foz = {1, XiXj, . . .Xn-1, XiZ2X3Zi. . .Xn-iZn, Z2Z4. . . Zn}, 

GzY = {1, Z1Z3 . . . Zn-u Z1Y2Z3Y4 . . . Zn-iYn, Y2Yi . . . Yn}, (16) 

where, in each case, the rotation axis for odd qubits is perpendicular to the rotation axis for even qubits. Notice 
that, if desired, the same averaging effects may be obtained with DD groups which affect only even or only odd 
qubits. As an example, compare one of the pulse sequences derived from Gxy'- Pi = P3 ^ X1Y2X3Y4 . . .Xn-iYn 
and P2 — P4 = Y2Y4 . . .Yn, with a sequence acting only on odd qubits: Pi = P3 = Z1Z3 . . . Z^-i and P2 — Pi — 
Y1I3 . . . Iat-i, which is derived from Qodd — {1, XiXy, . . . X^-i, YiY^ . . . Iat-i, ZiZy, . . . Zn^i, }. Both lead to the 
same transformed Hamiltonians in each segment of free evolution, and therefore to the same results. 

The small size of the DD group simplifies the derivation of the leading terms in the AHT, which, in turn, help 
anticipating the long-time behavior of the protocols. In view of this, the strategy of this section is to first obtain and 
discuss the results for H'^^\ H^^\ and i?'^' analytically, and then validate the analysis with numerical simulations. 

A. Analytical results 

For clarity, we show here the results obtained for a system described by H§j^, and leave the case where the linear 
chemical-shift Hamiltonian is retained, H^.^j^^, to the Appendix. 

1. Lowest-order average Hamiltonian 

At Tn — riTc = 4nAi, first-order DD is achieved with any of the deterministic protocols, as discussed in Sec. Ill, 

5(°) ^ H,+H2 + H, + H4 ^ Q 

4 

At these times, for all randomized protocols except NRD, we also have, in the worst case, Hef/i'inAt) oc 0{At). 

2. First-order contribution to the average Hamiltonian 

Using Eq. P?)) . the first-order correction to the average Hamiltonian, H'-^^ = — i(At)^{[i?4, iJs] -|- [H4,H2] + 
[H4, H{\ + [H3, H2] + [H3, H{\ + [H2, Hi]}/i2T,), simplifies to 

H^'^ = -'^{[Hi,H,] + [H2,m]}, (18) 

whose result varies according to the group path selected. For each representation in Eq. (jTH]), the 4! available paths 
lead to the following six different results, 

N-2 

H^^^ = ±J^aAt J2 iY^X,+lZ,+2 + Z,X,+iY,+2) , (19) 
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N-2 



ij(i) = ij^cAi Y. (x,y,+iZ,+2 + z,y,+iX,+2) , (20) 

1=1 

Af-2 

ij(i) = ±J^M J2 {X,Z,+iY,+2 + Y,Z,+iX,+2) . (21) 

j=i 

Note that, in the three-body contributions appearing in H^^' , the direction a of the middle operator matches the 
direction of the interaction term cr^ (g) ciVi that most frequently (three times) changes sign within the interval 
[0,4Ai]. Therefore, for an anisotropic model with a > 1, paths that change the sign of the Ising term after every At, 
as in Eq. (HH, are preferable to those leading to Eqs. P^ and pp)) . as intuitively expected. 

In order to eliminate H^-^\ we may employ symmetrized sequences such as SDD, PCDD2, or PSCPD2. Specifically: 

• The SDD cycle consists of eight intervals of free evolution characterized by the transformed Hamiltonians in the 
following order. Hi, H2, H3, H4, H4, H3, H2, Hi - or (1234 — 4321) for short. The last four intervals correspond 
to a FDD sequence where 1 -^ 4, 2 ^ 3, 3 ^ 2, and 4^1, which inverts the sign of H^^^ in Eq. (|18p . leading to 
H^'^\2nTc) = 0. Equivalcntly, for SRPD, HeffiSnAt) ex O ((Ai)^). 

• PCDD2 is characterized by sixteen intervals of duration At, (1234 — 2143 — 3412 — 4321), which is also symmetric, 
ensuring H^^^inTc) = 0. Interestingly, half oi this sequence also leads to H^^\2nTc) = 0, since, according to 
Eq. p^ . we can change the sign of H^^^ by simply switching the order in the pairs: 12 -^ 21 and 34 -^ 43. 

• PSCPD2 is given by the sequence (1234 - 4321 - 4123 - 3214 - 3412 - 2143 - 2341 - 1432), so that after every 
eight intervals At we have H'^^\2nTc) — 0. 

3. Second-order contribution to the average Hamiltonian 

The three sequences given above do not cancel H'''^\ In fact, even higher levels of concatenation (or permutations) 
are still incapable of eliminating the second order term in the AHT, due to the sequence pre-determined structure. 
The same 4 (8) different paths employed in PCDD2 (or PSCPD2) are the only ones appearing also at ^ > 2 (m> 2), and 
whether alone or in rearranged combinations with each other, they cannot cancel H^"^' . This is to be contrasted with 
the sequence introduced in the end of this subsection, which incorporates a larger variety of group paths and does 
lead to H^'^'^ = 0. In order to better analyze the structure of H'^'^\ let us take advantage of Eq. p?)) and write 

H^^^ = -^{[(2i?i +^2), [Hi,H2]] + [i2H4 + Hs), [H4,Hs]]}. (22) 

Because H^''^ — for fc == 0, 1, to obtain H'-^^ at T„, we only need to sum H^^'' computed for each of the n intervals 
[0,4Ai]. It is straightforward to verify that H'-^^ obtained with a PDD sequence is identical to the one computed 
with its corresponding SDD, since the result for (1234) is equal to that for (4321). Furthermore, the symmetry of 
Eq. (|22p allows to simplify the computation of H^^^ for PCDD2 and PSCPD2. For the first, we need to evaluate H^^'' 
only for (1234) and (2143), whereas the latter requires the calculation of ^(2) f^j. (1234), (4123), (3412), and (2341). 
Notice that, up to third order in the AHT, the same results arc then obtained for this system with either PCDD2 or 
half oi this sequence. Even though if-^^ is the dominant term for SDD, PCDD2, and PSCPD2, the last two sequences 
lead to a significant improvement. This may be understood upon close inspection of a particular pulse sequence 
based on Gxy, characterized by the path {1, X1Y2 . . . Xn-iYn, X1X3 . . .X^-i, Y2Yi...YN} - which leads to 
Pi = P3 = X1Y2X3Y4 . . . X]sf_iY]sf and P2 = -P4 = Y2Y4 ■ ■ ■ Yn- The following exact results are found: 

{ N-2 N-2 

- J2 (^«^»+2 + YiY+2 ~ 2Z^Z,+2) - a(YiY2 + Yn-iYn + 2 ^ YY+i 
4=1 1 = 2 

, N-3 

+ -r 2_^ [XiXiJ^2 {2YiJ^iYi+^ — Zj+iZi+a) + YiYi^2 {2XiJ^iXi^^ — Z^+iZi+a) — ZiZij^2 {Xi+iXiJ^^ + yi+ili+a)] 
•^ i=i 



JV-3 ~j 

+2a 2_^ ZiXi^iXi+2Zi+3 > 
t=i J 
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PCDD2 : i/(2)(4r,) = -j\Atf2a I - ^ {Xa^+2 + Y,Y,+2 - 2Z,Z,+2) 



.3.^1 



+ -r ^^ [XiXi+2 (2i^i+il^i+3 — Zi-|_iZi-|_3) + l^ili+2 (2Xi+iXi+3 — Zi+iZi+^) — ZiZi+2 {Xi+iXi+^ + Fi+iFi+a)] 

PSCPD2 : H(2)(8T,) = +j3(Ai)2a <^ - ^ (Z,Z,+2 + Y^Y^+2 - 2X^X^+2) 

+ - 2_^ [ZiZi+2 i^Yi+iYi+s — Xi+iXi+s) + YiYi+2 (2^i+i^i+3 — Xi+iXi+3) — XiXi+2 (Yi+iYi+s + Zi+iZi+z)] 



3 

i=l ) 

The results vary slightly for other control paths (see the Appendix for a comparison between two possibilities), but 
the basic conclusion remains unchanged: the number of bilinear and four-body terms reduces when we switch from 
SDD to PCDD2 or PSCPD2. In particular, notice that, contrary to SDD, the bilinear terms in PCDD2 and PSCPD2 involve 
only next-nearest-neighbor interactions. 

In the case of H§j^^j^, where both linear and bilinear terms need to be taken into account, the outcomes for H''^\ 
k = 0,1,2, become strongly dependent not only upon the group path, but also on the representation chosen, as 
demonstrated numerically in the next subsection and analytically in the Appendix. 

4- Effect of group reducibility 

It is insightful to contrast the results of CDD and SCPD obtained here for the spin chain described by -ff^vw with the 
case of a single qubit subject to a magnetic field of unknown direction, described by the Hamiltonian Ho = B ■ a. In 
both problems, the DD group consists of four elements, however for H^j^ the group action on the system's Hilbert 
space is reducible, whereas for the single qubit it is irreducible. In the latter case, the irreducible decoupling group, 
G = {t,X,Y,Z}, is able to substantially decrease the power of At in the average Hamiltonian for higher levels of 
concatenation and permutation. The table below [78!| summarizes the order of H for the first four levels: 

Isolated Single Qubit 



Level 


PCDDi 


PSCPDm 


1 
2 
3 

4 


0{{At)) 
0{{Atr) 
0((Ai)23) 
O((At)80) 


0({Aty 
oUAt)^ 
OUAtf 
0((Ai)5 



As the level of concatenation increases, a superpolynomial convergence is verified, establishing CDD as the best per- 
former for this system. For a single qubit coupled to an environment, the results depend fairly sensitively on the pure 
bath Hamiltonian [111 . Il2l | , which is renormalized by the control action [30| and whose interplay with the system-bath 
coupling terms is responsible for determining the final convergence rate. Still, provided that the environment dynam- 
ics is sufficiently slow, it has been verified that among the proposed protocols, CDD remains the method of choice in 
the presence of generic single-qubit errors 29|, |30|, 179| . 

Having spelled out the advantages and limitations of CDD and SCPD, we now proceed to describe possible strategies 
to further improve protocol performance: 

• One option, which is especially relevant for reducible DD groups, as in Eq. p^ . consists in truncating CDD 
and SCPD at the first level beyond which no further improvement is verified {£ = 2 and m=2 in the system 
under investigation), and then embedding the resulting periodic sequence with random pulses derived from 
an irreducible group, such as the Pauli group G^ = ®iGi- This way, the remaining terms in the effective 
Hamiltonian may still be reduced. 

• Another alternative is to take into account a larger number of group path realizations, and combine them into a 
supercycle sequence which, besides H'^^^ and _ff'^^\ also cancels H^'^\ This may be achieved, for instance, with 
the sequence (1234 — 2143 — 2314 — 3241 — 3124 — 1342) - see description below. Once the appropriate sequence 
has been found, we may again exploit randomization and embed the supercycle with random pulses. 

• Clearly, we may seek sequences which eliminate additional higher-order terms, although there may be in general 
some disadvantages associated with this: (i) the sequences may become much longer, and therefore harder to 
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implement; (ii) searching for them may become very demanding, especially when dealing with complex systems 
and larger DD groups; (iii) in real settings, pulse errors need to be taken into account, which further significantly 
increases the complexity of the search problem. 

5. Supercyde sequence: H'-°'> = H'-^^ = H'-'^'' = 

In NMR, WAHUHA-based-supercycle sequences which are capable of eliminating dipolar interactions up to third 
order have long been devised [ij. A simple approach consists in combining three WAHUHA sequences cyclically 
permuted ^80]. In our case, however, permutations of the basic path (1234) are not sufficient, and more group 
path realizations are required. Indeed, H^''^' — 0, k — 0,1,2, may be achieved, for instance, with the sequence 
(1234 — 2143 — 2314 — 3241 — 3124 — 1342). Notice that each eight intervals of this scheme corresponds to a different 
half-PCDD2, which guarantees that H^''^2Tc) = for fc = 0, 1. Furthermore, by using Eq. ([22|) and adding the results 
for H^^'' obtained with each of the six PDD sequences contained in the supercycle, we arrive at the desired result, 
Hy^>{QTc) = 0. Thus, at every T„ = 6nTc, the first three terms in the average Hamiltonian are simultaneously 
canceled - leading to better averaging than for CDD or SCPD obtained in a cycle time even shorter than for PSCPD2. 

In terms of pulses, this sequence, which we will refer to as H2 henceforth, translates into: 

U,{&T,)^Pa{PcPaPc)Pb{PcPaPc)Pc{PbPcPb)Pa{PbPcPb)Pb{PaPbPa)Pc{PaPbPa). 

where, for any path from Eq. (|16p which starts with the identity, that is, {1, ffi, <?2,53}, we have Pa — gi — gsgli 
Pb — 323i = 53, and Pc = gsgl — 52- Notice that the two axes of rotations involved in the basic first-order-DD 
sequences change every 8At, and the direction appearing at every 4Ai alternates according to the following rule: it 
starts with Pc, is followed by Pb, and is finally Pa, being then repeated. This is to be contrasted with PSCPD2, where 
Pc does not appear, 

(1234 - 4123 - 3412 - 2341) => Ud^T,) = l{PBPAPB)t{PAPBPA)t{PBPAPB)t{PAPBPA) , 

and with PCDD2, where Ci is fixed, Pc is the only rotation appearing in between two Ci's, and only Pa or Pb appear 
between C2's. 

C2 ^ C/c(4r,) = l{Ci)Pc{Ci)l(C,)Pc{Ci), 
C3 ^ Uc{16Tc) = Pb{C2)Pa{C2)Pb{C2)Pa{C2). 

B. Numerical results 

We validate the previous analytical analysis by studying a A^ = 8 qubit system described by Eq. ^2]) . subject to 
selective DD pulses derived from Eq. [161 Whenever appreciably different, the results are also contrasted with those 
obtained for the cubically decaying Hamiltonian given by Eq. (fT3|) . Notice that in the latter case, DD sequences have 
been developed based on generalized Hadamard matrices [14| , which may also be written in a group form as presented 
in [4J. For N = 8 qubit, a possible representation is given by 

08 = { 1, Z3Z4Y5YQXjXg,, Z2Y3X4ZqYjXs, ^2X3^415X6^7, 

Y2Y4X5ZqXiZs, Y2Zj,X4Z^XqYq, A2 13^4X5^718, A2A3Z5I6I7Z8} . 

1. Averaging of bilinear couplings: Isotropic system 

We first focus on the bilinear interaction terms alone, as in Eq. (J14p . with the main goal of comparing determin- 
istic and randomized protocols at long evolution times, where the advantages of the latter are predicted to become 
important. 

As an initial illustration of the fast accumulation of errors occurring in periodic deterministic schemes, in the left 
panel of Fig. [2] we assume a PDD sequence and contrast the data acquired at intra-cycle times, i„ = nAi/5, with data 
obtained only at the completion of each cycle, T„ — AnAt. The intra-cycle curve oscillates in time. At short times, 
the peaks in performance coincide with the instants of cycle completion, but as time evolves these two values become 
progressively detuned. This effect becomes more pronounced at longer times and for larger values of At, indicating 
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FIG. 2: Long time behavior of DD sequences based on Qzy GS]) and applied to -ffjvjv jMl with TV = 8, a = 1, and At — 0.1J~ . 
Left panel: PDD sequence. (Black) Curve; data acquired at intra-cycle times, t„ = nAt/5; (red) crosses: data at Tn = 4nAi. 
Right panel: Deterministic vs Randomized DD schemes. Data acquired at T„ — 4nAi. Average fidelity over 10^ control 
realizations. 



that best performance is not necessarily achieved at T!„, and suggesting that repeating the same sequence after every 
cycle time may not be the best strategy. 

We next proceed with a quantitative comparison between protocols. While different ways for effecting such a 
comparison are conceivable, the most natural choice for contrasting cycUc and acyclic schemes is to fix the interval 
between consecutive pulses, implying that higher levels of concatenation and permutation may need longer times to 
be reached. Data is acquired after every T„ = 4nAt, which for some of the protocols provides information about the 
performance in between their defining inner sequences. In the case of H^j^ and for times r„ > 30Tc, we find, in 
increasing order of performance: NRD, PDD, SDD, EMDi, CDD, EMD2, RPD, SCPD, SRPD, EPCDD2, and EPSC PD9 . Since, with 
the exception of permutation-based protocols, these results have been already partially presented in [44!, Sal, we limit 
ourselves to showing in the right panel of Fig. [2] the two best deterministic schemes and the three best randomized 
protocols, briefly commenting on the others in what follows. 

(i) NRD shows the poorest performance, consistent with the fact that the DD group is now very small and all 
protocols involve simultaneous rotations. 

(ii) PDD is unaffected by the representation or group path selected, whereas for i^^;,, different choices lead to a range 
of different results, which broadens as |C/| increases. Such a dependence also affects SDD and randomized protocols 
where the inner code is based on a fixed pulse sequence, such as EMD. 

(iii) EMD2 outperforms EMDi, which is not surprising given that the former involves an ensemble of 4^ random pulses, 
whereas the latter has only 4. A comparison between RPD and EMD2 is more subtle, due to the interplay between 
three factors: available repertoire of random pulses, chances for symmetrization being achieved at T„ = 8nAi, and 
sensitivity to path selection. For the A^A^-isotropic system, RPD shows the best performance, while for iJ^^j, specific 
path choices of the inner code lead to superior performance of EMD2 - see Ref. [4J]. In general situations where 
significant performance spread exist with respect to control path, even though superior EMD2 sequences may exist, 
searching for them becomes demanding when |t/| is large, which justifies the use of RPD as a practical choice. 

(iv) As seen in the right panel of Fig. [21 SRPD surpasses first CDD and then SCPD at sufficiently long times. In 
contrast, for the system described by -ff^^ with same Tc value, CDD is found to decay slower, being surpassed by SRPD 
only at T > 48Tc (see Fig. 2 in [11]), whereas SCPD is outperformed by SRPD already at T > ATc. Still, the fact that 
such a simple sequence as SRPD may outperform more elaborate deterministic methods such as CDD and SCPD vividly 
exemplifies the advantages of randomization. 

(v) The periodic sequences PCDD2 and PSCPD2 embedded with pulses randomly picked from Q^ perform better than 
SRPD. In Fig. [3l we show the dispersions around the mean value for each of the three random schemes: as expected, 
they all broaden at longer times. The best protocol, EPSCPD2, exhibits also the narrowest dispersion. Therefore, by 
combining randomization, symmetrization, and permutation, a DD scheme which is still relatively simple and yet 
efficient may be created. 
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FIG. 3: Deterministic vs randomized DD schemes based on Qzy (|16p applied to -ff^jv Gil) with N = 8 and a — 1. Data 
acquired at T„ = 4nAf, At = 0.1J~^. Average fidelity over 10^ control realizations. 



2. Sequence optimization 



Up to this point, the recipe we have been used to develop better DD protocols has consisted of deriving a first-order 
DD sequence, (PDD) from AHT, and then improving it by exploiting deterministic strategies and randomization. We 
now address an alternative numerical approach to design high-level protocols. When creating algorithms to search for 
efficient protocols, the freedom in terms of types of controls (axis and angle of rotation), number of qubits affected 
at each step, and values of intervals between pulses is enormous, and taking all of these factors into account would 
make the analysis intractable. Thus, in line with what we have done so far, we restrict to ideal selective pulses drawn 
from the sets in Eq. (flB)) . and separated by a fixed interval Ai. 

The algorithm we propose may be described as follows. At every r„, we search among the |t/|! = 24 different group 
paths the one which leads to the largest value of {{F^)) at T„+i; the best sequence from T„ to T„+i is then stored, 
and the same search procedure is iterated for the next intervals, so that the sequence is built up piece by piece. The 
resulting sequence is named ALGOR and is shown in the right panel of Fig. [2l In a sense, this method shares some 
similarities with popularly employed genetic algorithms 8l|, |82| . Here, the entire domain depends on the final time 
r„, consisting of (|5|!)" different individuals. However, instead of randomly generating an initial population from this 
entire range of possible solutions, our initialization is based only on the set of |t/|! paths for the interval [0, |fJ|Ai]. 
For each new interval [T„, T„_|_i] and with reference to the same population of \Q\l paths, a new generation, bred from 
the best sequence for [r„_i, T!„], is selected. The fitness function corresponds to {{F^ (Tn+i))) : it strongly depends 
on time as well as on the previously selected ancestors. 

Below, we show the structure of the first 72 intervals of free evolution for the optimized pulse sequence obtained 
with the parameters of Fig. [2| 



(1234 - 2143 - 2314 - 3241 - 3124 - 1342) 
(4312 - 4213 - 1423 - 4132 - 2431 - 3421) 
(4231 - 2413 - 4123 - 4321 - 3412 - 1432) 



(23) 



Observe that the first line corresponds to the scheme H2 already discussed in Sec. VI. A. 5. Each of the two additional 
lines in Eq. ([23| also individually leads to the cancellation of H^^\ H^^\ and H'^^K The third-order decoupling is 
one reason for the significant improvement of this sequence when compared with the others in Fig. [51 Another very 
important, and related, contributing factor is the uninterrupted variation of the control path at every r„ = AnAt. 
Notice that up to T = 72 Ai, 18 different control paths are used. This is to be contrasted with CDD (SCPD), where, 
for any level of concatenation (permutation), only 4 (8) different paths can be employed, variations being associated 
only with the order they are arranged. 

Frequent path alteration is at the heart of methods employing path randomization, which makes it worth to 
further scrutinize the behavior of simpler sequences, whereby we use randomization on top of sequences achieving 
ff(°)(24nAt) = ff(^)(24nAi) = //(2)(24nAi) = 0. Another motivation for this analysis is the fact that the algorithm 
proposed above clearly becomes unfeasible for large DD groups. In such cases, turning to simpler alternatives becomes 
a necessity. Let us then select the first line in Eq. ((23l) and create three new protocols: (a) a deterministic scheme where 
the 24 free intervals are periodically repeated (PH2); (b) a randomized scheme (RH2), where the path for the interval 
[24nAt, 24nAt -I- 4Ai] is picked at random and the subsequent interval [24nAt -|- 4At, 24nAt -I- 24Ai] is rearranged 
so that at 24(n + l)Ai, the three terms, H'^^\ H^^\ and ^^^^ cancel; (c) another randomized scheme, (EH2), where 
the first line is used as an inner code to be embedded with random pulses from Q^ . These protocols are compared in 
Fig. H with ALGOR and EPSCFDa. 
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FIG. 4: DD schemes that guarantee H^°'> = Jf'^' = if^^' = at 24nAt and EPSCPD2. System described by H§f^ with iV = 8 
and a — 1. Data acquired at T„ = 4nAf, Ai = 0.1J~^. Notation: ALGOR - sequence obtained via the numerical algorithm 



explained in the text; PH2 - periodic sequence; 
Average fidelity over 10^ control realizations. 



EH2 - embedded sequence with random pulses from Q ; RH2 - random path. 



Notice that the inner code of the two new randomized sequences is shorter than that for EPSCPD2, yet they perform 
significantly better, EPSCPD2 being cfoser in performance to the deterministic scheme PH2. Interestingly, at very long 
times, RH2 and EH2 outperform even the ALGOR sequence. It is therefore clear that the algorithm used here cannot 
identify the best scheme for very long times, the reason being the extreme sensitivity of pulse sequence performance 
to the final time. Take, for example, two instants of time Ta and Tg, with Tb > Ta- The sequence that leads to the 
best result at Ta is not necessarily the beginning of the one giving the best result at Tg. The algorithm employed 
looks for the best future pulses to be added to the paths that were already selected and which cannot be further 
altered. The randomized sequences, on the other hand, have in storage realizations that may be worse than the ALGOR 
scheme at Ta, but which will contribute to better realizations at Tb- 



3. Linear terms and anisotropy 



Attention so far has focused on averaging out the bilinear terms of an isotropic system. As a next step, we consider 
-^z+NN' ^y taking into account one-body terms and the effects of anisotropy. As a main feature, deterministic 
schemes (and by extension randomized schemes employing fixed inner codes) turn out to be strongly dependent upon 
the selected representation and control path. In such conditions, protocols based on path randomization become more 
advantageous for two main reasons: First, even though they need not lead to the best results, they ensure robust 
behavior against path variations; Second, it may simply be too demanding to find the best control path when dealing 
with large \G\- We shall compare two deterministic protocols, CDD and SCPD, with SRPD in the presence of anisotropy 
and linear Zeeman terms characterized by Si. The effective Hamiltonian for these three schemes is of ^^((Ai)^), 
and exact analytical results for the second-order contribution H''^'' of the deterministic protocols are provided in the 
Appendix. 

Let us start by investigating the additional effects of the one-body terms in H^,j^jy. The selective pulses are now 
drawn from QxY in Eq. (|16p. since Gxz and Qzy do not cancel linear terms. Two paths are examined: 



Path 1 : {1, X1Y2 . . . Xn-iYn, X1X3 . . . Xn-i, Fa^ . . . Fat}, 
Path 2: {1, XiX3...Xn-i, X1Y2 . . .Xn^iYn, Y2Yi...YN}. 



(24) 



For 5i > Ja, based on Eqs. ([201), (|M|, (IM|) . (|^ . we expect Path 1 to be the best choice for PDD, SDD, and CDD, 
whereas Path 2 is more suitable for SCPD. This is demonstrated numerically for CDD and SCPD in the left panel of 
Fig. [SI where an isotropic system, a = 1, is considered. Once again, the randomized scheme, SRPD, surpasses the 
deterministic protocols at sufficiently long times. In the case where Si < Ja, the competition between a and Si 
complicates the selection of the best control path, which encourages the use of randomized-path schemes. 

In order to isolate the effects of the anisotropy, we discard the one-body terms and return to H^j^, but this time 
with a 7^ 1. As indicated by Eqs. (HOI), (EB), jM]), and (|X4l) . PDD and SDD are expected to perform better for Path 2, 
which is somehow intuitive, since this is the path that changes the sign of the Ising term more frequently. Predictions 
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Deterministic vs randomized DD schemes based on Qxy (|16() applied to -ff^jv a-nd -ffz+Arjv, N = 8. 



FIG. 5 

at every Tn = 4nAt 

line. Left panel: H^ 



At = 0.05J . For deterministic schemes: solid line - Path 
v; Q = 1. Qubits distinguished by chemical shift: Si — lOJ, i- 



1; dashed line - Path 2. 



Data is acquired 
SRPD: dot-dashed 



-odd; Si = 0, i-even. Right panel: H 



Z+NN 



and Of = 5. Average fidelity over 10^ control realizations. Notice that the value of At considered here is half the one used in 
Figs. [21 13] refiecting the fact that, in the presence of Si and for a > 1, fidelity decays faster. 



of this sort become less trivial when dealing with CDD and SCPD, since H^"^^ for the interactions alone is very similar for 
both paths. Therefore, the identification of the best path for these protocols requires either precise knowledge about 
the system and tedious computations of higher-order terms in the average Hamiltonian, or a numerical search over 
an ensemble of realizations. Both options may be avoided if instead we employ a randomized protocols such as SRPD. 
In the right panel of Fig. [5l we compare the paths from Eq. (f24|) for an anisotropic system controlled via CDD, SCPD, 
and SRPD. In stark contrast to PDD and SDD, CDD and SCPD perform better for Path 1. Notice also that CDD appears 
to be more robust than SCPD against path variations; still, as before, they are both surpassed by SRPD at long times. 
Overall, the following conclusions may be drawn: Various analytical and numerical strategies exist or may be devised 
to improve the performance of deterministic protocols. However, DD may always benefit from randomization in terms 
of: pulse sequence simplification, robustness to path variations, and slower accumulation of residual averaging errors. 

VII. PULSE IMPERFECTIONS 

Throughout the previous analysis, we have assumed perfect control resources, implying, in particular, the ability 
to effect perfect instantaneous pulses. In practice, attainable control operations are far from ideal, a variety of 
both systematic and random imperfections contributing to deteriorate protocol performance. Systematic errors, 
in particular, may be especially harmful at long times, since their effects tend to be cumulative. Depending on 
implementation detail, different control non-idealities may be relevant [1], including: finite-width effects; deviations 
from the intended rotation angles, which may in turn be common to all pulses or different for different sets of controls; 
phase errors, arising from the fact that the phases of different pulses are not necessarily in quadrature; phase transients 
associated with control switching. By way of illustration, we focus on analyzing how DD performance is affected by 
pulses of finite duration and fiip-angle errors. The three protocols with effective Hamiltonian of ©((At)^), CDD, SCPD, 
and SRPD, are selected for such investigation, some discussion about PDD being also presented. The case of a system 
described by ^^^v i^ explicitly considered, with DD pulses being drawn from Qzy- 



A. Finite pulse widths 



In realistic control settings, the power fi is not infinite nor is the pulse duration r equal to zero. As a first 



approximation, pulses may be assumed to have a rectangular profile (for shaped pulses see e.g. Refs. [83ll84ll85ll86l|). 
and phase transients associated with the instants they are turned on and off [Ij may be disregarded, so that the 
desired rotation angle is simply determined by the product (3 = fir. 

In the presence of finite pulses, first-order DD is no longer achieved. Instead, after the completion of the first PDD 
cycle, we find 
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which cancels only in the limit r/Ai ^ 0. This is to be contrasted with the WAHUHA sequence, where first-order 
DD may still be achieved by properly adjusting the rotation angle according to r/Ai [l|. In our case, depending on 
such a ratio, small deviations from (3 = n lead simply to hardly perceptible improvements on the results for Ff-{Tc), 
as shown in the top panels of Fig. [S] To justify this improvement, higher-order terms in the average Hamiltonian are 
needed, since it is most probably caused by the interplay between these terms and H^^' . 
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FIG. 6: Ideal vs finite- width pulses for deterministic and randomized DD schemes derived from Qzy (|16() . System described 
by H§f^ with Af = 8, Q = 1, and At = O.IJ"^ Top panels: ((Fe^(Tc))) vs P/tv. Bottom panels: Decay in time for {{F^)) 
for 13 = -K. Left panel: t — Q. Middle panels: r = 0.005J~^. Right panels: r = 0.01J~^. Average fidelity over 10^ control 
realizations. 



Since, for the values of r/Ai considered here, the improvement in fidelity obtained by varying /3 is negligible, 
in the bottom panels of Fig. [6] we simply fix /3 = tt and compare PDD, CDD, SCPD, and SRPD. Similarly to Fig. [21 
SRPD outperforms the deterministic schemes at long times. However, SRPD deteriorates faster with finite pulses than 
the deterministic schemes. As a result, for very large errors, of the order of t/ At > 10%, the gain achieved with 
randomization is offset by the errors and the performance of SRPD becomes comparable to that of SCPD. 



B. Flip angle errors 



Flip angle errors may be caused by power misadjustment in the pulse generator, variations of the transmitter power 
output, or radio-frequency inhomogeneities [l|]. Here, we focus on systematic flip angle errors which are common to 
all pulses. This corresponds to a small over-rotation e of the intended 7r-pulses, and is described by 



exp[ 



-in{l + e)cr|''V2] = -lsin(e7r/2) - icrf ' cos(e7r/2) 



(26) 



In the top panel of Fig. [71 we consider an over-rotation of 1% (which is relatively larger than what may be found for 
instance in typical resonance experiments [ij, Is^), and compare SCPD, CDD, and SRPD. As in the case of ideal pulses, 
CDD is outperformed by SRPD, however the crossing between SCPD and SRPD is no longer verified. This may be better 
understood by observing the middle panels, where we show the difference D{e) between the fidelity obtained with 
ideal and with faulty DD pulses, {{Ff))^=Q and {{F^))^, respectively. Interestingly, errors may contribute favorably 
to the performance of deterministic schemes, as indicated by the negative values of D{e). When e = 0.01, the 
improvement for CDD is modest and occurs at intermediate times, while SCPD shows a significant increase in fidelity at 
long times. Contrary to that, flip-angle errors have always a detrimental impact on randomized schemes. Therefore, 
the accumulation of high-order terms of the average Hamiltonian in SCPD is counterbalanced with the positive effects 
caused by errors e ~ 0.01, while the advantages of randomization in SRPD cannot compensate for the sensitivity to 
pulse imperfections, resulting in the worse performance of the latter. 

As a further illustration of the effect of flip-angle errors in deterministic schemes, we show in the bottom panel the 
effects of e on PDD. At very short times, e enhances the fidelity decay, while this situation is reversed at longer times. 



22 



D{e) 




FIG. 7: Deterministic vs randomized DD derived from Qzy- System described by H§i^, N = 8, a = 1, and At = O.IJ ^ . Top 
panel: e = 0.01. Middle panels: 1% and 2% stand for D(O.oi) and D(0.02), respectively. Bottom panel: D(e) for e = 0.01 
(black solid line) e = 0.02 (red short-dashed line), e = 0.03 (green long-dashed line), and e = 0.04 (blue dot-dashed line). 
Average fidelity over 10^ control realizations. 

Contrary to SCPD and CDD, where e > 0.01 mostly worsens protocol performance, a consistent improvement of PDD at 
longer times is observed for errors up to e ^ 0.03. 

In short, even though deterministic protocols appear to be more protected against finite width and flip-angle errors 
than randomized schemes, in the case of relatively small errors the advantages of randomization at long times are still 
dominant. From this perspective, a promising next step may arise from combining randomized with bounded-strength 
Eulerian design [lO.] , which is explicitly intended to compensate unwanted evolution during pulses and offer enhanced 
fault-tolerance. 



VIII. CONCLUSIONS 



A. Summary 



We have developed a quantitative comparison between deterministic and randomized DD protocols in closed sys- 
tems described by a time-independent Hamiltonian, confirming the advantages of randomization at long evolution 
times and the efficiency of control protocols which combine multiple decoupling strategies - such as randomization, 
symmetrization, concatenation, and cyclic permutations. We have also argued how the search for better deterministic 
sequences in a large set of possibilities may be shortcut by using randomization to develop simple, yet very efficient 
protocols. While the main emphasis has been on removing bilinear interactions in a spin-l/2-particle-system with 
isotropic NN couplings, a number of results in the presence of anisotropic couplings and one-body terms have also 
been established. Furthermore, a comparison between DD results for NN and for long-range cubically decaying 
interactions has been included. Two types of DD groups have been considered: an inefficient group whose size in- 
creases exponentially with the system size, and may be easily extended to systems with long range couplings; and a 
very efficient group, which leads to only four simultaneous pulses and is specifically designed to systems with NN 
interactions. 

In the case of inefficient averaging, we have shown that different paths to traverse the DD group lead to a broad 
range of results, where PDD sequences involving collective rotations tend to perform better than those consisting mainly 
of single rotations. For large groups, the selection of the best deterministic protocols becomes very demanding, which 
favors protocols that average over various possibilities, such as NRD. One step further consists in applying RPD, which 
already pre-selects the most efficient pulse sequences to be included in the average. Additionally, we have showed 



23 

that in situations where the DD group is so large that a single cycle can be hardly completed, the performance of NRD 
is similar to the best PDD performance. 

The small number of pulses involved in efhcient DD schemes has allowed for a thorough analytical study. This has 
offered insight into understanding why different paths and group representations do not affect DD performance in the 
case of isotropic NN couplings, and also to partially predict the best control choices in the presence of anisotropy and 
one-body terms - which have been next numerically validated. Most importantly, the analytical results have shed light 
on the reasons for the limited performance of concatenated protocols (and protocols based on cyclic permutations) in 
the class of systems under consideration, and paved the way to the development of a better control sequence able to 
decouple interaction up to third order, at least. The key idea has been to access more path realizations than those 
available to CDD and SCPD, and yet rearrange them such that the structure of half-PCDD2 was kept. 

Numerical simulations have served a twofold purpose: to confirm and extend the analytical predictions; and to 
identify the best randomization strategy. While randomization is unquestionably advantageous at long times, whether 
it is better to embed a deterministic sequence with random pulses or to apply path randomization strongly depends 
on the system at hand. If the inner code varies significantly with the path, and the search for the best option is 
demanding, path randomization always proves more adequate. 

Along with the numerical analysis, we have also proposed an algorithm to search for new DD schemes. This 
has resulted in an extremely efficient pulse sequence based on frequent path alteration. Interestingly, however, this 
sequence turned out to be outperformed by a very simple scheme which combined the initial pulses from the algorithm 
sequence with randomization. The main take-away message is that even though an optimal deterministic sequence 
may always exist for a particular system at a specific final time, identifying it may be beyond reach, in which case 
resorting to simpler, yet efficient randomized sequences becomes a practical method of choice. 

At last, the effects of two control non-idealities - finite width pulses and flip-angle errors - have been quantified. 
Deterministic protocols appear to be better protected against such imperfections, although the relative gain due 
to randomization still dominates if the errors are relatively small. A complete analysis of fault-tolerance requires, 
however, consideration of additional compensation mechanisms along with randomization, which we intend to address 
elsewhere. 

B. Outlook 

The selection of an adequate DD protocol ultimately depends on details about the system and the control objective 
to be achieved. A sequence like PCDD2, for example, is excellent to decouple a single qubit from its surrounding 
bath 29, 30., 79], but performs poorly at freezing evolution in a spin chain with NN interactions. Similarly, the 
WAHUHA sequence combined with cyclic permutations lead to third order DD of the dipolar Hamiltonian [l| , whereas 
PSCPD2 is unable to cancel /f'-^-' in HffN- When the control pulses aim at complete refocusing, the total time during 
which information needs to be stored is a decisive factor in the choice of a protocol. By comparing the evolution 
times in Figs. [2] and 01 for example, one sees that SRPD is a good enough method in the first situation, although 
not worth consideration in the latter. Another important consideration stems from the desired control goal: the 
removal of unwanted evolution, independent of the choice of initial state, as addressed here by analyzing the decay 
of entanglement fidelity; or the preservation of a specific, known initial state. The latter scenario may allow for the 
development of dedicated pulse sequences ensuring yet better performance - as exemplified by long-time coherence 
saturation effects observed in both NMR spin- locking experiments [l|) and in quantum information storage [2^, [30, UM ■ 

Throughout this work, the time interval between consecutive pulses in PDD has a fixed value At, while for other 
protocols actual rotations may be separated by some integer multiples of At. If this constraint is relaxed, so that 
consecutive rotations may be arbitrarily spaced, substantial freedom is added in principle to DD design. In this sense, 
the existence of optimized sequences for specific control settings, as in [131 , clearly points to the potential of unevenly- 
spaced sequences for higher-order DD. The analysis and combination of multiple control time scales and different 
various angles of rotation is clearly an issue which deserves additional exploration in the context of randomization, 
along with the identification of a QIP platform which may be suitable to experimentally test some of the benefits 
predicted for randomized coherent control. 
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APPENDIX A: DOMINANT TERMS IN THE AVERAGE HAMILTONIAN 

Here we consider the general Hamiltonian with NN interactions, H^^^^j^ (flS)) . where both anisotropy (a 7^ 1) as 
well as one-body terms may be present, and provide explicit results for the first three contributions to the average 
Hamiltonian in the case of deterministic protocols. 

1. Lowest-order average Hamiltonian: //' ' 

Representations Qxz and Qzy (|16p . which involve group elements in the z direction and also representations 
affecting only half of the qubits cannot cancel all one-body terms. If complete refocusing of the Hamiltonian is the 
goal, QxY is the representation to be used, for it guarantees H'^^^Tc) = 0. Let us consider two particular pulse 
sequences characterized by the following paths: 

Path 1 : {1, X1Y2 . . . Xn-iYn, X1X3 . . . Xn-i, Y2Yi . . . Yn}, 

Path2: {t, X1X3...XN-1, XiY2...XN-iYN,Y2Yi...YN}. (Al) 

2. First-order contribution to the average Hamiltonian: H^^' 
For PDD sequences from Gxy that change the sign of the Ising interaction after every Af , such as Path 2, we find 



ij(i)(Tc) = T^At 



/ . 5 it^i+i + 2_^ ^ A.,ri+i 



N-2 



± •/ Ai 2_^ {XiZi+iYi+2 + YiZi+iX,, 



i+2) , 



i=l 



(A2) 
while for Path 1, Eq. (P0|) still holds. The interplay between anisotropy and qubit frequencies becomes now a 
determining factor in the selection of an appropriate group path. 

3. Second-order contribution to the average Hamiltonian: H^'^' 

We compute H^^'' for the two pulse sequences of Gxv in Eq- (|Aip . The following results arc found for SDD, PCDD2, 
and PSCPD2. Notice that for reasons explained in Sec.VI.A, H^'^\2Tc) for SDD equals H'^^\Tc) for PDD. 

Pathl : (A3) 

C / JV-2 \ ^-3 >| 

SDD : ij(2) =--La-2JAlD, + Q,-al Y^Y2 + Yx^iYx + 2 ^ F,y,+i 1 + 2a ^ Z,X,+iX,+2Z,+3 \ 

PCDD2: H^^^ =-^La-2JA{D, + Q,) 
PSCPD2 : iT(2) ^ ^^^ ^ j^p^, ^ g^) 



Path 2 : (A4) 

SDD : ij(2) = (Ai)2 i {5i + 62)Zi + Yl (^»-i + 2'5« + S.,+i)Z, + {5n-i + 5N)ZN-odd \ 

I i-odd J 
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4(M' 



J2 iS^ + S^+l)S^+lY,Y^+l + ^ {5i + 6i+i)6,YiYi+i 
—odd i — even 

l^ 2— even ) 

{ I ( ^~^ \ 2 ^^^ 

2JAId, + Q,--[ Y1Y2 + Yn-iYn + 2 Y, y^Y^+l + - II X,Z,+iZ,+2X.. 



a 

i=l 



PCDD2 : ^(2) ^ __^^ _ 2JA{D, + Q,) 

PSCPD2 : i/(2) ^ ^^^ ^ j^(^^ ^ g^) 

6 

where the following quantities have been introduced: 

A = J^{Atfa, 
La ~ 2_^ K*^* ^ Si-^i)YiYi+iZi+2 — {Si+i — Si+2) ZiYi+iYi+2] , 

i— odd 

+ 2_^ [('^« ^ Si+i)XiXi+iZi+2 — i^i+l — (^i+2)ZiXi+iXi+2] , 
i— even 

ib = - ^ [i'2Si + Si+i)YiYi+iZi+2 + {Si+1 +25i+2)ZiYi+iYi+2], 

i— odd 

+ _2^ [{Si + 2di+i)XiXi+iZi+2 + {2Si+i + Si+2)ZiXi+iXi+2] , 



z— even 

N-2 



2 ^"^ 

-Da; = -T 7^ (^1^^1+2 + ZiZi+2 — 2XiXi+2), 



Qz ^ -;; 2_^ [XiXi-i-2 {2Yi+iYi+3 — Zi+iZi+3) + YiYi+2 {2Xi+iXi+3 — Zi+iZi+3) — ZiZi+2 (Xi+iXi+s + Yi+iYi+3)] , 

^ Af-3 

■T 2_^ [YiYi+2 (22'j+iZi+3 — Xi+iXi+3) + ZiZi+2 (21^i+lli+3 — Xi+iXi^^) — XiXi+2 (^i+l^i+3 + ^i+l'^j+s)] • 
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